Replay apparatus, signal processing apparatus, and signal processing method

ABSTRACT

A method of selectively performing signal processing in a first mode and in a second mode. In the first mode, a noise cancel signal having a signal characteristic to cancel an external noise component is generated based on a voice signal supplied from a microphone, and an input digital audio signal and the noise cancel signal are combined into a voice signal to be output through a speaker. In the second mode, a sound process for vocal voice is performed on a voice signal supplied from a microphone, a vocal voice component is canceled from a digital audio signal of input music to generate a karaoke signal, and the karaoke signal and the vocal signal are combined into a voice signal to be output through a speaker. The first mode corresponds to an audio replay operation accompanied by noise cancel, and the second mode corresponds to a karaoke operation.

BACKGROUND

The present disclosure relates to a replay apparatus, a signalprocessing apparatus, and a signal processing method and, moreparticularly, to a technology for selectively performing a music replayoperation and a karaoke operation.

PRIOR ART Patent Document

-   [Patent document 1] Patent Application Publication No. 2001-34277

A lot of general users enjoy the benefits of listening to music by meansof portable audio players. It is well known that an audio playerincluding an earphone equipped with a microphone may allow users toenjoy listening to music with less noise in a noisy situation byobtaining a signal of an opposite phase to that of an external noisesignal collected through the microphone and adding the obtained signalto an audio signal. In addition, a lot of general users may beentertained with karaoke.

SUMMARY

A portable audio player and a karaoke system are completely differentfrom each other. Accordingly, such a place as home has to be equippedwith a karaoke system for a user to enjoy karaoke. The presenttechnology is conceived to provide a replay apparatus, such as aportable audio player, which enables users to conveniently enjoy karaokeat any time.

According to an embodiment of the present disclosure, there is provideda replay apparatus which includes a music source unit configured tooutput a digital audio signal of music; a microphone signal input unitconfigured to input a voice signal supplied from a microphone; a noisecancel signal generating unit configured to generate a noise cancelsignal having a signal characteristic to cancel an external noisecomponent based on the voice signal input by the microphone signal inputunit; a vocal processing unit configured to perform a sound process forvocal voice on the voice signal input by the microphone signal inputunit to generate a vocal signal; a karaoke signal generating unitconfigured to generate a karaoke signal by canceling a vocal voicecomponent from the digital audio signal supplied from the music sourceunit; a combination unit configured to perform a first combining processwhere the digital audio signal supplied from the music source unit andthe noise cancel signal are combined, and a second combining processwhere the karaoke signal and the vocal signal are combined; a controlunit configured to control the combination unit to perform the firstcombining process in a first mode and to perform the second combiningprocess in a second mode; and an output unit configured to output thecombined signals combined by the combination unit as a voice signal tobe output from a speaker. For example, the noise cancel signalgenerating unit, the vocal processing unit, the karaoke signalgenerating unit and the combination unit may be installed as softwareprocessing functions in an operation processing device. Further, thecontrol unit may controls the operation processing device to execute thenoise cancel signal generating unit and to execute the combination unitto perform the first combining process in the first mode, and thecontrol unit may control the operation processing device to execute thevocal processing unit and the karaoke signal generating unit and toexecute the combination unit to perform the second combining process inthe second mode.

According to another embodiment of the present disclosure, there isprovided a signal processing apparatus which includes a microphonesignal input unit configured to input a voice signal supplied from amicrophone; a noise cancel signal generating unit configured to generatea noise cancel signal having a signal characteristic to cancel anexternal noise component based on the voice signal input by themicrophone signal input unit; a vocal processing unit configured toperform a sound process for vocal voice on the voice signal input by themicrophone signal input unit to generate a vocal signal; a karaokesignal generating unit configured to generate a karaoke signal bycanceling a vocal voice component from the digital audio signal of inputmusic; a combination unit configured to perform a first combiningprocess where the input digital audio signal and the noise cancel signalare combined, and a second combining process where the karaoke signaland the vocal signal are combined; a control unit configured to controlthe combination unit to perform the first combining process in a firstmode and to perform the second combining process in a second mode; andan output unit configured to output the combined signals combined by thecombination unit as a voice signal to be output from a speaker.

According to another embodiment of the present disclosure, there isprovided a method of selectively performing signal processing in a firstmode and in a second mode. Further, in the first mode, a noise cancelsignal having a signal characteristic to cancel an external noisecomponent is generated based on a voice signal supplied from amicrophone, and an input digital audio signal and the noise cancelsignal are combined into a voice signal to be output through a speaker;and in the second mode, a sound process for vocal voice is performed ona voice signal supplied from a microphone, a vocal voice component iscanceled from a digital audio signal of input music to generate akaraoke signal, and the karaoke signal and the vocal signal are combinedinto a voice signal to be output through a speaker.

The present technology is conceived to provide a replay apparatus, suchas a portable audio player, which enables such an operation processingdevice as a digital signal processor (DSP) configured to perform digitalaudio signal processing (particularly, noise cancel processing) to beconverted to perform karaoke signal processing. Accordingly, a user mayuse the replay apparatus to listen to music in a first mode and to serveas a karaoke system in a second mode. Since such a configuration mayonly be accomplished by changing internal process of the operationprocessing device, no hardware has to be added. Further, a microphonewith noise cancel function, such as a microphone installed in anearphone unit, may be used as a vocal microphone for karaoke. Inaddition, in the second mode for karaoke, the operation processingdevice may also perform a variety of sound processes for vocal voice.

Effects of the Invention

The present technology enables users to use a replay apparatus to listento music as well as to conveniently enjoy karaoke.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a view illustrating an audio player according to an exemplaryembodiment of the present disclosure;

FIG. 2 is a view illustrating an audio player which is used to implementa karaoke system according to an exemplary embodiment of the presentdisclosure;

FIG. 3 is a view illustrating an audio player which is used to implementa karaoke system according to an exemplary embodiment of the presentdisclosure;

FIG. 4 is a view illustrating an audio player which is used to implementa karaoke system according to an exemplary embodiment of the presentdisclosure;

FIG. 5 is a block diagram illustrating an audio player according to anexemplary embodiment of the present disclosure;

FIG. 6 is a view illustrating a digital signal processor (DSP) of anaudio player for processing signals in a noise cancel (NC) mode and akaraoke mode according to an exemplary embodiment of the presentdisclosure;

FIG. 7 is a view illustrating an NC signal generating unit and a vocalprocessing unit in an audio player according to an exemplary embodimentof the present disclosure;

FIG. 8 is a view illustrating a DSP process according to an exemplaryembodiment of the present disclosure;

FIG. 9 is a view illustrating a DSP process according to an exemplaryembodiment of the present disclosure;

FIG. 10 is a view illustrating a signal process in operating in karaokemode with an earphone mounted according to an exemplary embodiment ofthe present disclosure;

FIG. 11 is a view illustrating a DSP process in operating in karaokemode with an earphone mounted according to an exemplary embodiment ofthe present disclosure;

FIG. 12 is a block diagram illustrating a beamforming unit of a vocalprocessing unit according to an exemplary embodiment of the presentdisclosure;

FIG. 13 is a view illustrating mid-presence filter (MPF) characteristicsof a beamforming unit according to an exemplary embodiment of thepresent disclosure; and

FIG. 14 is a view illustrating a noise cancel unit according to anexemplary embodiment of the present disclosure.

DETAILED DESCRIPTION OF THE EMBODIMENT(S)

Hereinafter, preferred embodiments of the present disclosure will bedescribed in detail with reference to the appended drawings. Note that,in this specification and the appended drawings, structural elementsthat have substantially the same function and structure are denoted withthe same reference numerals, and repeated explanation of thesestructural elements is omitted.

Further, preferred embodiments of the present disclosure will bedescribed in the following order. In the appended claims, a replayapparatus according to an embodiment of the present disclosure is aportable audio player, and a signal processing apparatus is incorporatedin an audio player according to an embodiment of the present disclosure.

<1. Example of audio player operating in karaoke mode>

<2. Structure of audio player>

<3. Signal processing in NC mode and karaoke mode>

<4. Processing examples of DSP>

<5. Modified examples>

1. Example of Audio Player Operating in Karaoke Mode

A portable audio player according to an embodiment of the presentdisclosure may be used for a user to replay and enjoy music and may alsobe used as a karaoke system.

FIG. 1 is a view illustrating an audio player 1 according to anembodiment of the present disclosure. The audio player 1 includes areplay unit and a digital signal processor (DSP) for signal processing,which are encased in a small portable case, to output a voice signal.The audio player 1 further includes a display unit 14 and an operatingpart 12 a. The audio player 1 is typically used with an earphone device2. The earphone device 2 includes L- and R-channel speaker units 2L, 2R,a cord 2 b, and a plug 2 c. A user listens to music replayed from theaudio player 1 by connecting the plug 2 c of the earphone device 2 to ajack 19 of the audio player 1 and wearing the speaker units 2L, 2Rinside his/her ears. The present embodiment illustrates an earphonewhich the user wears inside his/her ears. However, a headphone which theuser wears over his/her ears may be used.

As described below, the speaker units 2L, 2R are equipped withmicrophones to collect external noises. The audio player 1 provides auser with a replayed voice with a reduced noise by generating a noisecancel signal based on a voice signal collected by the microphones andadding the noise cancel signal to an audio signal. In the presentdisclosure, “noise cancel” may be denoted by “NC.”

The user may use the audio player 1 to enjoy karaoke in the followingmanner. FIG. 2A is a view illustrating the audio player 1 which is usedby a user to implement a karaoke system by connecting the audio player 1to an external amplifier 4. The audio player 1 has an externalconnection terminal (not shown) by which to connect to the amplifier 4.The amplifier 4 is connected to speakers 5, 5. In this case, a voicesignal output from the audio player 1 is output as voice through thespeakers 5, 5.

The user uses the microphones installed in the speaker units 2L, 2R ofthe earphone device 2, i.e., microphones, which are usually used tocollect external noises, are used as vocal microphones. Further, theaudio player 1 generates a karaoke signal from audio data of a replayedmusic by performing a vocal cancel process on the audio data of areplayed music. Further, the audio player 1 mixes the karaoke signalwith a microphone input voice signal which is the voice of a singinguser. The audio player 1 supplies the mixed voice signal to theamplifier 4. The mixed voice signal is output from the speakers 5, 5. Assuch, the user may sing to a music (karaoke music) from the externalspeakers 5, 5 and the singing voice may also be output from the speakers5, 5, thereby implementing a karaoke system.

Instead of the microphone for noise collection of the earphone device 2illustrated in FIG. 2A, for example, a mono microphone 3M illustrated inFIG. 2B or a stereo microphone 3S illustrated in FIG. 2C may beconnected to the jack 19 of the audio player 1. In this case, the usermay hold the audio player 1 and sing a song using the audio player 1 asa vocal microphone.

FIG. 3 is a view illustrating the audio player 1 which is used toimplement a karaoke system by connecting the audio player 1 to themonitor 6 (e.g., television set or personal computer monitor). Themonitor 6 is equipped with a display 6D and speakers 6S, 6S. In FIG. 3,the audio player 1 is mounted on a cradle 7 through which the monitor 6is connected in wired or wireless manner for data communication. Theaudio player 1 supplies output audio data through the externalconnection terminal to the cradle 7. The cradle 7 transmits the audiodata to the monitor 6. Further, the audio player 1 is connected to theearphone device 2 so that two users may use microphones for noisecollection, which are installed in the speaker units 2L, 2R, as vocalmicrophones. While the earphone device 2 may be used for one user, theearphone device 2 may be used for two users singing a duet since theearphone device 2 is operated in stereo.

The audio player 1 generates a karaoke signal from audio data of areplayed music by performing a vocal cancel process on the audio data ofa replayed music. Further, the audio player 1 mixes the karaoke signalwith a microphone input voice signal, which is the voice of a singinguser. The mixed voice signal is supplied to the monitor 6 through thecradle 7 and is output through the speakers 6S, 6S. As such, the usermay sing to the music (i.e., karaoke music) from the speakers 6S, 6S ofthe monitor 6 and the user's singing voice may also be output from thespeakers 6S, 6S, thereby implementing a karaoke system.

On the other hand, the audio player 1 may be equipped with a function ofdisplaying lyrics which are accompanied by the replayed music. In thiscase, lyrics data corresponding to the audio data of the replayed musicmay also be configured to be supplied to the monitor 6. In this case,since the lyrics may be displayed on the display 6D of the monitor 6,this is suitable for the karaoke. Further, in FIGS. 2A and 2B, thelyrics may be displayed on the display unit 14 of the audio player 1.

FIG. 4 is a view illustrating the audio player 1 and the earphone device2 which are used to implement a karaoke system. The user may wear thespeaker units 2L, 2R of the earphone device 2 so that he/she may listento voice output from speakers 21L, 21R in the speaker units 2L, 2R. Thespeaker units 2L, 2R are equipped with microphones 22L, 22R for noisecollection. The microphones 22L, 22R may be used as vocal microphones.In this case, while the vocal microphones are placed around the user'sears rather than around the user's mouth, the user's voice may becertainly collected by performing the following beamforming process onthe audio player 1.

The audio player 1 generates a karaoke signal from audio data of areplayed music by performing a vocal cancel process on the audio data ofa replayed music. Further, the audio player 1 mixes the karaoke signalwith a voice signal input from the microphones 22L, 22R, which is thevoice of a singing user. The mixed voice signal is output from thespeakers 21L, 21R. As such, the user may listen to karaoke music withthe earphone device 2 and sing to the karaoke music while listening tohis/her singing voice with the earphone device 2, thereby simplyimplementing a karaoke system.

2. Structure of Audio Player

A structure of the audio player 1 according to an embodiment of thepresent disclosure which may be used as a karaoke device in theabove-mentioned manner will be described with reference to FIG. 5. FIG.5 illustrates that the earphone device 2 is connected. The speaker units2L, 2R of the earphone device 2 are equipped with the speakers as wellas the microphones for external noise collection, as described above.That is, as shown in FIG. 5, the speaker unit 2L includes the speaker21L and the microphone 22L, while the speaker unit 2R includes thespeaker 21R and the microphone 22R. The earphone device 2 and the audioplayer 1 are electrically connected to each other as shown in FIG. 5 bythe contact between the plug 2 c and the jack 19 which are shown in FIG.1.

As shown in FIG. 5, the audio player 1 includes a replay unit 10, acontrol unit 11, an operating unit 12, a display controller 13, adisplay unit 14, an external communication unit 15, a DSP 16, amicrophone input unit 17, and an earphone output unit 18.

The replay unit 10 is a music source unit outputting a digital audiosignal such as music. The replay unit 10 includes a recording medium forstoring, for example, music content thereon, and a decoder for decodingdata of the music content read from the recording medium. Examples ofthe recording medium may include solid-state memory, such as flashmemory, and a hard disc drive (HDD). Further, instead of such a built-inrecording medium, examples of the recording medium may include a drivecorresponding to a removable recording medium, such as a memory cardequipped with solid-state memory, an optical disc, such as a compactdisc (CD) or a digital versatile disc (DVD), a magneto-optical disc, orhologram memory. It should be understood that both the built-in memory,such as solid-state memory or HDD, and the drive for the removablerecording medium may be mounted on the replay unit 10. For example, sucha recording medium has such data as music content encoded by a voiceencoding technique. The replay unit 10 decodes the coded data of musiccontent read from the recording medium and outputs digital audio signalsDaL, DaR as, for example, L- and R-channel linear PCM data, to the DSP16. The replay unit 10 may receive digital audio signals transmitted inwireless or wired manner from external devices, and output the digitalaudio signals DaL, DaR as L- and R-channel linear PCM data.

Voice signals collected by the microphones 22L, 22R of the earphonedevice 2 is input to the audio player 1 through the microphone inputunit 17. The voice signal input through the microphone 22L is amplifiedby the microphone amplifier 32L and converted into a digital signal bythe A/D converter 31L. The voice signal input through the microphone 22Ris amplified by the microphone amplifier 32R and converted into adigital signal by the A/D converter 31R. The voice signals convertedinto the digital signals (hereinafter referred to as “microphone inputsignals SmL, SmR) are supplied to the DSP 16.

The DSP 16 performs appropriate operations on the digital audio signalsDaL, DaR which are supplied from the replay unit 10. Further, the DSP 16performs appropriate operations, such as noise cancel process, on thevoice signals (microphone input signals SmL, SmR) which are inputthrough the microphone input unit 17. The DSP 16, which is a processorimplemented by software, includes an audio processing unit 16 a, a noisecancel signal generating unit (hereinafter referred to as “NC signalgenerating unit”) 16 b, a vocal processing unit 16 c, a karaoke signalgenerating unit 16 d, and a combination unit 16 e.

The audio processing unit 16 a performs operations, such as equalizationor gain adjustment, on the digital audio signals DaL, DaR to be outputto the earphone device 2. The equalizing operation includes soundquality correction, such as amplitude-frequency characteristiccorrection and/or phase-frequency characteristic correction. The gainadjusting operation performs volume amplification or volume limitationfor the digital audio signals DaL, DaR.

The NC signal generating unit 16 b generates noise cancel signals, whichhave a signal characteristic of canceling external noise components,based on the microphone input signals SmL, SmR which are input from themicrophone input unit 17. In other words, the NC signal generating unit16 b generates a signal of an opposite phase to that of an externalnoise component which is collected by the microphones 22L, 22R.

The vocal processing unit 16 c processes the microphone input signalsSmL, SmR input from the microphone input unit 17 into vocal signals byperforming a sound process on the microphone input signals SmL, SmR tobe suitable for vocals. The karaoke signal generating unit generates akaraoke signal with no or less vocal sound by canceling vocal voicecomponents from the digital audio signals DaL, DaR. The combination unit16 e performs a first combining process to combine the digital audiosignal processed by the audio processing unit 16 a and the noise cancelsignal generated by the NC signal generating unit 16 b. Further, thecombination unit 16 e performs a second combining process to combine thekaraoke signal generated by the karaoke signal generating unit 16 d andthe vocal signal from the vocal processing unit 16 c.

The signals processed by the DSP 16, i.e., the signals combined by thecombination unit 16 e are supplied as the output signals SsL, SsR to theearphone output unit 18. The output signal SsL is converted into ananalog signal by a D/A converter 33L, amplified by a power-up 34L,supplied to a speaker 21L and output as a sound. The output signal SsRis converted into an analog signal by a D/A converter 33R, amplified bya power-up 34R, supplied to a speaker 21R and output as a sound. Thatis, the earphone output unit 18 outputs the output signals SsL, SsR fromthe DSP 16 as voice signals to the speakers 21L, 21R. Further, theearphone output unit 18 may be configured to perform operations asdigital amplifiers.

The external communication unit 15 establishes communication withexternal devices, such as the amplifier 4 of FIG. 2, the monitor 6 orthe cradle 7 of FIG. 3, in wired or wireless manner. The output signalsSsL, SsR output from the DSP 16 may be transmitted to external devicesthrough the external communication unit 15. That is, the externalcommunication unit 15 outputs the output signals SsL, SsR from the DSP16 to external speakers as output voice signals.

The operating unit 12 and the display unit 14 are provided for userinterface. The operating unit 12 detects, for example, the user'soperation on the operating part 12 a or the touch panel in FIG. 1 andsupplies the operation information to the control unit 11. The displayunit 14 includes a liquid crystal panel or an organicelectroluminescence (EL) panel and displays a variety of informationunder the control of the display controller 13. For example, the displayunit 14 is configured to display replay operations, replayed musiccontent, or messages.

The control unit 11 includes a microcomputer (CPU: central processingunit) and controls each component according to programs and the user'soperations on the operating unit 12 to output audio signals.Specifically, the control unit 11 controls the output of the digitalaudio signals DaL, DaR in the replay unit 10 or the processes of the DSP16. Further, the control unit 11 instructs the display controller 13 todisplay operating information on the display unit 14 according tooperating conditions. Further, the control unit 11 may establishcommunication with external devices through the external communicationunit 14.

In particular, in the present embodiment, the control unit 11 maycontrol the DSP 16 to be switched to the NC (noise cancel) mode or tothe karaoke mode so that the DSP 16 may be operated accordingly.Specifically, the control unit 11 controls the combination unit 16 e ofthe DSP 16 to perform the first combining process for the NC mode, orcontrols the combination unit 16 e of the DSP 16 to perform the secondcombination process for the karaoke mode. This process will be describedin detail.

The replay unit 10 may replay music content accompanied by lyrics dataof the music content. In this case, the lyrics data is supplied to thedisplay controller 13, and the display controller 13 controls thedisplay unit 14 to display the lyrics data. Further, the displaycontroller 13 may transmit the lyrics data as display data to externaldevices through the external communication unit 15. For example, in theembodiment as shown in FIG. 3, the display controller 13 may control thedisplay 6D of the monitor 6 to display the lyrics data.

3. Signal Processing in NC Mode and Karaoke Mode

The audio player 1 thus configured may perform operations in the NC modeand the karaoke mode by the user's operation. Specifically, for example,if the user selects one of the modes through the operating unit 12, thecontrol unit 11 controls the DSP 16 to perform the NC mode or thekaraoke mode.

The process of the DSP 16 in each of the modes will be described withreference to FIG. 6. As described above, the DSP 16 serving as anoperation processing device may perform operation processes with theaudio processing unit 16 a, the NC signal generating unit 16 b, thevocal processing unit 16 c, the karaoke signal generating unit 16 d, andthe combination unit 16 e based on software programs. The operationprocesses are controlled by the control unit 11.

FIG. 6A illustrates a process flow of the DSP 16 when the control unit11 instructs the DSP 16 to perform an operation in the NC mode. In thiscase, the audio processing unit 16 a, the NC signal generating unit 16 band the combination unit 16 e are executed. Specifically, the audioprocessing unit 16 a performs equalizing process or gain adjustingprocess on the digital audio signals DaL, DaR which are supplied fromthe replay unit 10. After the process is completed, the audio processingunit 16 a supplies the processed digital audio signal DaL′ to an adder16 eL of the combination unit 16 e and supplies the processed digitalaudio signal DaR′ to an adder 16 eR of the combination unit 16 e.

The NC signal generating unit 16 b generates noise cancel signals SncL,SncR based on the microphone input signals SmL, SmR from the microphoneinput unit 17. Next, the NC signal generating unit 16 b supplies thenoise cancel signal SncL based on the microphone input signal SmL to theadder 16 eL of the combination unit 16 e and supplies the noise cancelsignal SncR based on the microphone input signal SmR to the adder 16 eRof the combination unit 16 e.

The adder 16 eL of the combination unit 16 e adds the digital audiosignal DaL′ and the noise cancel signal SncL into an output signal SsL.Further, the adder 16 eR of the combination unit 16 e adds the digitalaudio signal DaR′ and the noise cancel signal SncR into an output signalSsR.

For the NC mode thus processed by the DSP 16, the sound of music contentreplayed by the replay unit 10 is output from the speakers 21L, 21R. Asa result, the user may listen to the sound and, at the same time, may beprovided with comfortable music with reduced noises.

FIG. 6B illustrates a process flow of the DSP 16 when the control unit11 instructs the DSP 16 to perform an operation in the karaoke mode. Inthis case, the audio processing unit 16 a, the karaoke signal generatingunit 16 d, the vocal processing unit 16 c and the combination unit 16 eare executed. Specifically, the audio processing unit 16 a performsequalizing process or gain adjusting process on the digital audiosignals DaL, DaR which are supplied from the replay unit 10. After theprocess is completed, the audio processing unit 16 a supplies theprocessed digital audio signals DaL′, DaR′ to the karaoke signalgenerating unit 16 d. The karaoke signal generating unit 16 d performs,for example, vocal cancel process to generate karaoke signals SkL, SkR(signals with no or less vocal level). Next, the karaoke signalgenerating unit 16 d supplies the L- and R-channel karaoke signals SkL,SkR to the adders 16 eL, 16 eR of the combination unit 16 e.

The vocal processing unit 16 c performs a sound process on themicrophone input signals SmL, SmR from the microphone input unit 17 tobe suitable for vocals. After the sound process is completed, the vocalprocessing unit 16 c supplies the processed L- and R-channel signals(vocal signals SvL, SvR) to the adders 16 eL, 16 eR of the combinationunit 16 e.

The adder 16 eL of the combination unit 16 e adds the L-channel karaokesignal SkL and the vocal signal SvL into an output signal SsL. Further,the adder 16 eR of the combination unit 16 e adds the R-channel karaokesignal SkR and the vocal signal SvR into an output signal SsR.

For the karaoke mode thus processed by the DSP 16, the karaoke soundfrom the music content replayed by the replay unit 10 from which thevocal voice is removed is output from the speakers 21L, 21R and, at thesame time, the voice of the singing user is output as the vocal voicefrom the speakers 21L, 21R. In other words, the karaoke operation isperformed as illustrated in FIG. 4. Further, by transmitting the outputsignals SsL, SsR from the DSP 16 to the external devices through theexternal communication unit 15, the karaoke operation is performed asshown in FIG. 2 or 3. In this case, the microphones 3M, 3S may beconnected to the audio player 1 as shown in FIG. 2B or 2C. In this case,the microphone input signals SmL, SmR input from the microphone inputunit 17 to the DSP 16 may be voice signals collected by the microphones3M, 3S.

As described above, the audio player 1 may be configured to switch fromthe music replay mode to the karaoke mode or vice versa by only changingthe signal process in the DSP 16. The user may simply perform a modeselect operation to select the music replay mode or the karaoke mode andenjoy music or karaoke. For the karaoke mode, the user may enjoy thekaraoke most conveniently in the embodiment of FIG. 4 where themicrophones 22L, 22R are used as vocal microphones in the earphonedevice 2 equipped with the NC function. Further, when the audio player 1is configured to be connected to the external device or to use othermicrophones as shown in FIG. 2 or 3, the user may enjoy the karaoke moresatisfactorily. In addition, if the microphones 22L, 22R with noisecancel function are used, the audio player 1 may be used for a duet.

4. Processing Examples of DSP

Processing examples of the DSP 16 will be described in detail. FIG. 7Aillustrates a processing example of the NC signal generating unit 16 b.The NC signal generating unit 16 b generates the noise cancel signalsSncL, SncR in the NC mode where the user typically listens to music fromthe audio player 1. The microphone input signals SmL, SmR become voicesignals of external noise voices that are obtained through themicrophones 22L, 22R.

The NC signal generating unit 16 b includes NC filters 41, 43 andinverting amplifiers 42, 44. The NC filters 41, 43 are configured tofunction as, for example, high-rejection filters. The NC signalgenerating unit 16 b generates the noise cancel signals SncL, SncR byusing the NC filters 41, 43 to filter the microphone input signals SmL,SmR, respectively, and using the inverting amplifiers 42, 44 to invertthe phases of the filtered signals. By adding the noise cancel signalsSncL, SncR to the digital audio signals DaL′, DaR′, the user equippedwith the earphone device 2 may be provided with music with less noise,i.e., with external noise spatially erased.

FIG. 7B illustrates a processing example of the vocal processing unit 16c. The vocal processing unit 16 c generates the vocal signals SvL, SvRwhen the audio player 1 is operated in the karaoke mode. The microphoneinput signals SmL, SmR become voice signals of the voice of the singinguser which are obtained through the microphones 22L, 22R or the othermicrophones 3M, 3S.

The vocal processing unit 16 c includes an adder 51 and an echoprocessor 52. Specifically, the vocal processing unit 16 c is configuredto use the adder 51 to add the microphone input signals SmL, SmR and touse the echo processor 52 to perform echo process on the added signals.After the echo process is completed, the vocal processing unit 16 cdivides the echo-processed signals into L- and R-channel vocal signalsSvL, SvR. By the echo process thus performed by the vocal processingunit 16 c, it is possible to output a vocal sound which is the singingvoice with the echoing effect. In the present embodiment, the additionof L- and R-channels is followed by the echo process. However, it shouldbe understood that the echo process may be individually performed on themicrophone input signals SmL, SmR.

FIG. 8 illustrates processes of the karaoke signal generating unit 16 dand the vocal processing unit 16 c in the karaoke mode. The karaokesignal generating unit 16 d performs a vocal cancel process through anadder 61, a voice band-pass filter 62, and subtractors 63, 64. Thedigital audio signals DaL′, DaR′ are added by the adder 61 and suppliedto the voice band-pass filter 62. The voice band-pass filter 62 passes avoice band (e.g., 300 Hz˜3 kHz). Signal components of the voice band aresupplied to the subtractors 63, 64. The subtractor 63 subtracts a voiceband of signal component from the digital audio signal DaL′. Thesubtractor 64 subtracts a voice band of signal component from thedigital audio signal DaR′. As such, the karaoke signals SkL, SkR withreduced vocal voices are generated from the digital audio signals DaL′,DaR′ of the music content.

The vocal processing unit 16 c performs an echo process. In thisembodiment, the vocal processing unit 16 c includes reverb processors71-74 and adders 75, 76. Specifically, the microphone input signal SmLwith an echoing component added by the reverb processor 71 is suppliedto the adder 75 and, at the same time, an echoing component generated bythe reverb processor 73 is supplied to the adder 76. The microphoneinput signal SmR with an echoing component added by the reverb processor72 is supplied to the adder 76 and, at the same time, an echoingcomponent generated by the reverb processor 74 is supplied to the adder75. The adder 75 adds the microphone input signal SmL with the addedecho component and the echo component of the microphone input signal SmRinto the L-channel vocal signal SvL. The adder 76 adds the microphoneinput signal SmR with the added echo component and the echo component ofthe microphone input signal SmL into the R-channel vocal signal SvR.

The adders 16 eL, 16 eR of the combination unit 16 e adds the karaokesignals SkL, SkR and the vocal signals SvL, SvR into the output signalsSsL, SsR of the DSP 16. As such, the user may enjoy the karaoke soundaccompanied by the user's singing voice with rich reverb added.

FIG. 9 also illustrates processes of the karaoke signal generating unit16 d and the vocal processing unit 16 c in the karaoke mode. The karaokesignal generating unit 16 d performs the same process as that shown inFIG. 8. The present embodiment illustrates the vocal processing unit 16c that performs an anti-howling process in addition to the echo (reverb)process.

The vocal processing unit 16 c includes an adder 81, a reverb processor82, a band-limiting filter 83, phase shifters 84 a-84 d, and a selector85. The vocal processing unit 16 c controls the adder 81 to add themicrophone input signals SmL, SmR and controls the reverb processor 82to add an echoing component to the added signals. The signal from thereverb processor 82 is limited by the band-limiting filter 83. Forexample, the band-limiting filter 83 passes a voice band (i.e., 300 Hz˜3kHz). The signal of the voice band is supplied to the phase shifters84-84 d.

The phase shifters 84 a-84 d shift phases of the input signal by +90°,0°, −90° and 180°, respectively. Actually, the phase shifter 84 bshifting a phase of 0° may be implemented by a non-inverting amplifierwith a gain of 1, and the phase shifter 84 d shifting a phase of 180°may be implemented by an inverting amplifier with a gain of 1. Further,the phase shifters 84 a, 84 c shifting phases of +90° and −90°,respectively, may be implemented by Hilbert transform filters. Theselector 85 selects the output of any one of the phase shifters 84 a-84d, divides the selected output into the L- and R-channel vocal signalsSvL, SvR, and supplies the L- and R-channel vocal signals SvL, SvR tothe adders 16 eL, 16 eR of the combination unit 16 e, respectively. Theselection of the selector 85 is changed according to the user'soperation. The combination unit 16 e adds the vocal signals SvL, SvR andthe karaoke signals SkL, SkR, respectively, into the output signals SsL,SsR.

The process performed as shown in FIG. 9 may suppress the howling. Forexample, if a user as a singer recognizes a howling sound, the user mayoperate the operating unit 12 of the audio player 1 to select aphase-shift mode. That is, the selection of the selector 85 is randomlychanged. By finding the selection condition to remove the howling andchanging the phases of the vocal signals SvL, SvR, it is possible tomake it difficult for the howling to occur.

FIGS. 10 and 11 illustrate a process of the DSP 16 which is verysuitable for a user to enjoy karaoke in a self-contained manner usingthe earphone device 2 as illustrated in FIG. 4. Assuming that the userwears the earphone device 2 to use the microphones 22L, 22R as vocalmicrophones and listen to vocal and karaoke sound through the speakers21L, 21R as described with reference to FIG. 4, the signal flow is shownin FIG. 10.

The digital audio signals DaL, DaR of music content replayed by thereplay unit 10 are processed by the audio processing unit 16 a and thekaraoke signal generating unit 16 d in the DSP 16 into the karaokesignals SkL, SkR. The karaoke signals SkL, SkR are then supplied to thecombination unit 16 e. The user's singing voice is collected by themicrophones 22L, 22R and is input as the microphone input signals SmL,SmR to the DSP 16 through the microphone input unit 17. The vocalprocessing unit 16 c performs the following beamforming process on themicrophone input signals SmL, SmR to generate the vocal signals SvL,SvR. The vocal signals SvL, SvR are then supplied to the combinationunit 16 e. The combination unit 16 e adds the vocal signals SvL, SvR andthe karaoke signals SkL, SkR, respectively, into the output signals SsL,SsR. The output signals SsL, SsR are converted into analog signals andpower-up amplified by the earphone output unit 18 and presented to theuser as a combination of the karaoke sound and the singing voice throughthe speakers 21L, 21R.

FIG. 11 illustrates processes of the vocal processing unit 16 c and thekaraoke signal generating unit 16 d in the DSP 16 which are suitable forthe forgoing situation. The karaoke signal generating unit 16 d performsthe same vocal cancel process as that shown in FIG. 8. In thisembodiment, the vocal processing unit 16 c includes a beamformingprocessor 91 and a reverb processor 92. In this case, since the userwears the earphone device 2, the microphones 22L, 22R are placed nearthe user's ears rather than the user's mouth. In this case, a singer'svoice may be certainly collected by performing the beamforming process.In other words, the beamforming technique enables the sound to becollected with directivity.

In this case, two microphones (stereo microphones) are generally used.If forward or backward directivity is desired, the simplest beamformingprocess may be performed by the addition of voice signals from left andright microphones. In this case, since left- and right-channel voicesignal components of voice from a sound source located at equaldistances from the microphones are in phase, they are boosted by theaddition of the voice signal components. However, since voice signalcomponents of voice from a sound source at a different direction are outof phase, they are reduced by that much. As such, for example, a voicesignal with a forward directivity may be obtained. The two microphones22L, 22R installed in the speaker units 2L, 2R of the earphone device 2are located at almost equal distances from the user's mouth.Accordingly, only by the addition of the left and right microphone inputsignals SmL, SmR of the beamforming processor 91, it is possible toextract the user's singing voice despite noises. That is, thebeamforming process enables the user's singing voice to be correctlycollected with directivity and the noises to be reduced at the sametime. Further, the beamforming may be focused in directions other thanthe forward direction. In this case, by providing a delay device on onechannel, it is possible to absorb the time difference of equalwavefronts reaching the microphones. Hence, beamforming may be formed ininclined or transverse direction. Accordingly, delay processing may beperformed depending upon a positional relation between the microphones22L, 22R and the user's mouth when the user wears the earphone device 2.

Further, in order to obtain a more precise beamforming (in this case, ahigher directivity of the microphones 22L, 22R to the user's mouth and areduced noise), a noise suppression device using a band-pass filter maybe used.

FIG. 12 illustrates a structure (noise suppression processor) to be usedas the beamforming processor 91 of FIG. 11. As shown in FIG. 12, thebeamforming processor (noise suppression processor) 91 includes a soundsource direction determination unit 100A and a filter processor 100B. Inthis embodiment, the sound source direction determination unit 100Adetermines sound source directions at each of the first to third bandsfor the L/R channel microphone input signals SmL, SmR. The filterprocessor 100B includes three series-connected filters (MPFs: midpresence filters) 158, 159, 160 to boost or attenuate the voice signalsat the first to third bands.

The sound source determination unit 100A includes band-pass filters151L, 152L, 153L, 151R, 152R, 153R and sound source direction angleanalysis units 154, 155, 156. The band-pass filters 151L, 152L, 153Lhave central pass frequencies fc1, fc2, fc3, respectively. Forconvenience, the pass bands are denoted by BD1, BD2, BD3, respectively.The band-pass filters 151R, 152R, 153R have central pass frequenciesfc1, fc2, fc3, respectively. Likewise, the pass bands are denoted byBD1, BD2, BD3, respectively. The left-channel microphone input signalSmL is input to the band-pass filters 151L, 152L, 153L, therebyextracting voice signal components of the bands BD1, BD2, BD3. Theright-channel microphone input signal SmR is input to the band-passfilters 151R, 152R, 153R, thereby extracting voice signal components ofthe bands BD1, BD2, BD3.

The voice signal components of the band BD1 of the left- andright-channels, which are the outputs of the band-pass filters 151L,151R, are supplied to the sound source direction angle analysis unit154. The voice signal components of the band BD2 of the left- andright-channels, which are the outputs of the band-pass filters 152L,152R, are supplied to the sound source direction angle analysis unit155. The voice signal components of the band BD3 of the left- andright-channels, which are the outputs of the band-pass filters 153L,153R, are supplied to the sound source direction angle analysis unit156.

The sound source direction angle analysis unit 154 determines a soundsource direction of a dominant sound among the voice signal componentsof the band BD1. The sound source direction angle analysis unit 155determines a sound source direction of a dominant sound among the voicesignal components of the band BD2. The sound source direction angleanalysis unit 156 determines a sound source direction of a dominantsound among the voice signal components of the band BD3. Each of thesound source direction angle analysis units 154, 155, 156 determines thesound source direction at its corresponding band based on the energydifference of voice signals on each channel. The sound source directionangle analysis units 154, 155, 156 control the MPFs 158, 159, 160, whichcorrespond one-to-one to each other by the control signals SG1, SG2,SG3, respectively, according to the determined directions. As can beseen from FIG. 12, the sound source direction angle analysis unit 154controls the MPF 158; the sound source direction angle analysis unit 155controls the MPF 159; and the sound source direction angle analysis unit156 controls the MPF 160.

The filter processor 100B includes an adder 157 and MPFs 158, 159, 160.The MPFs 158, 159, 160 are a group of series-connected filters. Theadder 157 adds the left- and right-channel microphone input signals SmL,SmR. The voice signal (LR added signal), which is a combination of theleft- and right-channel microphone input signals that are added by theadder 157, is supplied to the MPF 158.

The MPFs 158, 159, 160 boost or attenuate their corresponding bands.Here, the three MPFs are provided since the band-pass filters 151L,152L, 153L, 151R, 152R, 153R of the sound source direction determinationunit 100A divide the microphone input signals SmL, SmR into three bands.The MPFs 158, 159, 160 have central frequencies fc1, fc2, fc3,respectively. Each of the MPFs 158, 159, 160 has filter characteristicsshown in FIG. 13. Each of the MPFs 158, 159, 160 is configured toamplify or reduce the gain with respect to a band of interest (a bandwith a central frequency of fc). As described above, the boost orattenuation of the band of interest by the gain adjustment, which isperformed in the MPFs 158, 159, 160, is controlled by the sound sourcedirection angle analysis units 154, 155, 156.

Specifically, while the MPF 158 boosts or attenuates the band BD1 with acentral frequency of fc1, the MPF 158 corresponds to the band-passfilters 151L, 151R and the sound source direction angle analysis unit154. Further, while the MPF 159 boosts or attenuates the band BD2 with acentral frequency of fc2, the MPF 159 corresponds to the band-passfilters 152L, 152R and the sound source direction angle analysis unit155. Further, while the MPF 160 boosts or attenuates the band BD3 with acentral frequency of fc3, the MPF 160 corresponds to the band-passfilters 153L, 153R and the sound source direction angle analysis unit156.

If beamforming is performed towards the user's mouth when seen from themicrophones 22L, 22R, a band where the direction of a sound source isdetermined as a target direction is boosted, while a band where thedirection of a sound source is determined as a different direction thanthe target direction is attenuated. The level of boost or attenuationvaries depending upon the determination of direction angle.

The MPFs 158, 159, 160 boost or attenuate the added microphone inputsignals SmL, SmR under the control of the sound source direction angleanalysis units 154, 155, 156. The output of the MPF 160 becomes theoutput signal Sout of the beamforming processor 91. As a result, theoutput of the beamforming processor 91 is a signal which is obtained bycorrectly collecting the user's singing voice (the sound around theuser's mouth) with reduced noises.

As shown in FIG. 11, the echoing component is added by the reverbprocessor 92 to the output signal of the beamforming processor 91. Theoutput of the reverb processor 92 is divided into the L- and R-channelvocal signals SvL, SvR, which are supplied to the adders 16 eL, 16 eR ofthe combination unit 16 e. The combination unit 16 e adds the vocalsignals SvL, SvR and the karaoke signals SkL, SkR, respectively, intothe output signals SsL, SsR. Accordingly, by the forgoing processes, itis possible to provide the user with high quality karaoke and vocalsound when the user enjoys karaoke in a self-contained manner.

Modified Examples

Although preferred embodiments of the present disclosure are describedin detail with reference to the appended drawings, the presenttechnology is not limited thereto. It should be understood that variousmodifications may be provided. In the preferred embodiments, themicrophone input unit 17, the DSP 16, the earphone output unit 18, andthe control unit 11 (the control unit 11 configured to control the DSP16) are installed in the audio player 1 to perform the NC mode and thekaraoke mode. On the other hand, as shown in FIG. 14, a noise cancelunit 8 may be provided separately from the audio player 1 and, forexample, installed in the middle of the earphone device 2. In this case,the noise cancel unit 8 may be configured as a signal processing devicewhich includes elements corresponding to the microphone input unit 17,the DSP 16, the earphone output unit 18 and the control unit 11 (thecontrol unit 11 configured to control the DSP 16) so that the user mayselectively enjoy the music and the karaoke. In other words, a signalprocessing device configured to implement the NC mode and the karaokemode may be provided separately from a replay apparatus, such as theaudio player 1.

Further, in the preferred embodiments, the microphones 22L, 22R (orother microphones) may be used for a duet. In this case, the vocalprocessing unit 16 c may be configured to individually perform such asound process as an echo process on each of the microphone input signalsSmL, SmR.

The vocal processing unit 16 c may be configured to perform other soundprocesses, such as vocal boost process, voice change process, harmonyadding process, or vocal level adjustment, than the forgoing soundprocesses. Examples of the vocal boost process may include equalizationfor boosting vocal bands or addition of harmonics components to vocalcomponents. An example of the voice change process may include changingfrequency characteristics of signals. An example of the harmony addingprocess may include operations of extracting a vocal signal,pitch-shifting the extracted vocal signal, and adding the shifted vocalsignal to the vocal signal.

In addition, the karaoke signal generating unit 16 d may be configuredto perform key adjustment (pitch shift). Specifically, by performingpitch shift on a karaoke signal of music, a user may adjust a key of themusic to his/her desired key.

Further, when the output signals SsL, SsR of the DSP 16 are transmittedto an external device through the external communication unit 15, theoutput signals may be recorded on a recording device without outputtingthe output signals from speakers of the external device.

Further, digital microphones may be used as the microphones 3M, 3S orthe microphones 22L, 22R. In this case, the microphone input unit 17 maynot include the microphone amplifiers 32L, 32R and the A/D converters31L, 31R. Accordingly, the microphone input unit 17 may be configured asan input interface from the digital microphones, or the DSP 16 may beconfigured to be equipped with the function of the microphone input unit17.

Additionally, the present technology may also be configured as below.

(1) A replay apparatus including:

a music source unit configured to output a digital audio signal ofmusic;

a microphone signal input unit configured to input a voice signalsupplied from a microphone;

a noise cancel signal generating unit configured to generate a noisecancel signal having a signal characteristic to cancel an external noisecomponent based on the voice signal input by the microphone signal inputunit;

a vocal processing unit configured to perform a sound process for vocalvoice on the voice signal input by the microphone signal input unit togenerate a vocal signal;

a karaoke signal generating unit configured to generate a karaoke signalby canceling a vocal voice component from the digital audio signalsupplied from the music source unit;

a combination unit configured to perform a first combining process wherethe digital audio signal supplied from the music source unit and thenoise cancel signal are combined, and a second combining process wherethe karaoke signal and the vocal signal are combined;

a control unit configured to control the combination unit to perform thefirst combining process in a first mode and to perform the secondcombining process in a second mode; and

an output unit configured to output the combined signals combined by thecombination unit as a voice signal to be output from a speaker.

(2) The replay apparatus according to (1),

wherein the noise cancel signal generating unit, the vocal processingunit, the karaoke signal generating unit and the combination unit areinstalled as software processing functions in an operation processingdevice, and

wherein the control unit controls the operation processing device toexecute the noise cancel signal generating unit and to execute thecombination unit to perform the first combining process in the firstmode, and the control unit controls the operation processing device toexecute the vocal processing unit and the karaoke signal generating unitand to execute the combination unit to perform the second combiningprocess in the second mode.

(3) The replay apparatus according to (1) or (2), wherein the vocalprocessing unit performs a beamforming process for the sound process forvocal voice.(4) The replay apparatus according to any one of (1) to (3), wherein thevocal processing unit performs a reverb process for the sound processfor vocal voice.(5) The replay apparatus according to any one of (1) to (4), wherein thevocal processing unit performs an anti-howling process for the soundprocess for vocal voice.(6) The replay apparatus according to any one of (1) to (5), wherein thekaraoke signal generating unit generates a karaoke signal with no vocalvoice component by extracting the vocal voice component from the digitalaudio signal supplied from the music source unit and subtracting thevocal voice component from the digital audio signal.(7) The replay apparatus according to any one of (1) to (6), wherein themicrophone signal input unit is configured to input a voice signalsupplied from a microphone installed in a case of a connected earphone.(8) The replay apparatus according to any one of (1) to (7), furtherincluding a display control unit outputting, as display data, lyricsdata corresponding to the digital audio signal output from the musicsource unit.

Although preferred embodiments of the present disclosure are describedin detail with reference to the appended drawings, the presenttechnology is not limited thereto. It should be understood by thoseskilled in the art that various modifications, combinations,sub-combinations and alterations may occur depending on designrequirements and other factors insofar as they are within the scope ofthe appended claims or the equivalents thereof.

The present disclosure contains subject matter related to that disclosedin Japanese Priority Patent Application JP 2011-125949 filed in theJapan Patent Office on Jun. 6, 2011, the entire content of which ishereby incorporated by reference.

1. A replay apparatus comprising: a music source unit configured tooutput a digital audio signal of music; a microphone signal input unitconfigured to input a voice signal supplied from a microphone; a noisecancel signal generating unit configured to generate a noise cancelsignal having a signal characteristic to cancel an external noisecomponent based on the voice signal input by the microphone signal inputunit; a vocal processing unit configured to perform a sound process forvocal voice on the voice signal input by the microphone signal inputunit to generate a vocal signal; a karaoke signal generating unitconfigured to generate a karaoke signal by canceling a vocal voicecomponent from the digital audio signal supplied from the music sourceunit; a combination unit configured to perform a first combining processwhere the digital audio signal supplied from the music source unit andthe noise cancel signal are combined, and a second combining processwhere the karaoke signal and the vocal signal are combined; a controlunit configured to control the combination unit to perform the firstcombining process in a first mode and to perform the second combiningprocess in a second mode; and an output unit configured to output thecombined signals combined by the combination unit as a voice signal tobe output from a speaker.
 2. The replay apparatus according to claim 1,wherein the noise cancel signal generating unit, the vocal processingunit, the karaoke signal generating unit and the combination unit areinstalled as software processing functions in an operation processingdevice, and wherein the control unit controls the operation processingdevice to execute the noise cancel signal generating unit and to executethe combination unit to perform the first combining process in the firstmode, and the control unit controls the operation processing device toexecute the vocal processing unit and the karaoke signal generating unitand to execute the combination unit to perform the second combiningprocess in the second mode.
 3. The replay apparatus according to claim1, wherein the vocal processing unit performs a beamforming process forthe sound process for vocal voice.
 4. The replay apparatus according toclaim 1, wherein the vocal processing unit performs a reverb process forthe sound process for vocal voice.
 5. The replay apparatus according toclaim 1, wherein the vocal processing unit performs an anti-howlingprocess for the sound process for vocal voice.
 6. The replay apparatusaccording to claim 1, wherein the karaoke signal generating unitgenerates a karaoke signal with no vocal voice component by extractingthe vocal voice component from the digital audio signal supplied fromthe music source unit and subtracting the vocal voice component from thedigital audio signal.
 7. The replay apparatus according to claim 1,wherein the microphone signal input unit is configured to input a voicesignal supplied from a microphone installed in a case of a connectedearphone.
 8. The replay apparatus according to claim 1, furthercomprising a display control unit outputting, as display data, lyricsdata corresponding to the digital audio signal output from the musicsource unit.
 9. A signal processing apparatus comprising: a microphonesignal input unit configured to input a voice signal supplied from amicrophone; a noise cancel signal generating unit configured to generatea noise cancel signal having a signal characteristic to cancel anexternal noise component based on the voice signal input by themicrophone signal input unit; a vocal processing unit configured toperform a sound process for vocal voice on the voice signal input by themicrophone signal input unit to generate a vocal signal; a karaokesignal generating unit configured to generate a karaoke signal bycanceling a vocal voice component from the digital audio signal of inputmusic; a combination unit configured to perform a first combiningprocess where the input digital audio signal and the noise cancel signalare combined, and a second combining process where the karaoke signaland the vocal signal are combined; a control unit configured to controlthe combination unit to perform the first combining process in a firstmode and to perform the second combining process in a second mode; andan output unit configured to output the combined signals combined by thecombination unit as a voice signal to be output from a speaker.
 10. Amethod of selectively performing signal processing in a first mode andin a second mode, wherein in the first mode, a noise cancel signalhaving a signal characteristic to cancel an external noise component isgenerated based on a voice signal supplied from a microphone, and aninput digital audio signal and the noise cancel signal are combined intoa voice signal to be output through a speaker; and in the second mode, asound process for vocal voice is performed on a voice signal suppliedfrom a microphone, a vocal voice component is canceled from a digitalaudio signal of input music to generate a karaoke signal, and thekaraoke signal and the vocal signal are combined into a voice signal tobe output through a speaker.